Environment Independent Speech Recognition System using MFCC (Mel-frequency cepstral coefficient)
نویسندگان
چکیده
Speech recognition is a method of finding similarity between two sequences. Various researches have been done on it. In our research, we are trying to achieve the optimal accuracy during the recognition procedure. Here, we are extracting features of the voice sample before filtering it through a noise reduction filter. For each individual, there are number of features are taken using feature extraction algorithm called mel frequency cepstral co-efficient (MFCC). After extracting the features of all the training samples, we have taken the average. Now extracting the features of the testing sample, similarities are calculated using dynamic time warping . We are comparing the results of frame error, word error of the previous work with our research. Keywords— Novel method, filtering, MFCC, DTW
منابع مشابه
Language and Text-Independent Speaker Identification System Using GMM
This paper motivates the use of Dynamic Mel-Frequency Cepstral Coefficient (DMFCC) feature and combination of DMFCC and MFCC features for robust language and text-independent speaker identification. MFCC feature, modeled on the human auditory system has been the widely used feature for speaker recognition because of its less vulnerability to noise perturbation and little session variability. Bu...
متن کاملReal Time Speech Recognition Using DSK TMS320C6713
Speech recognition is an important field of digital signal processing. Automatic Speaker Recognition (ASR) objective is to extract features, characterize and recognize speaker. Mel Frequency Cepstral Coefficients (MFCC) is most widely used feature vector for ASR. MFCC is used for designing a text dependent speaker identification system. In this paper the DSP processor TMS320C6713 with Code Comp...
متن کاملMFCC and Prosodic Feature Extraction Techniques:
In this paper our main aim to provide the difference between cepstral and non-cepstral feature extraction techniques. Here we try to cover-up most of the comparative features of Mel Frequency Cepstral Coefficient and prosodic features. In speaker recognition, there are two type of techniques are available for feature extraction: Short-term features i.e. Mel Frequency Cepstral Coefficient (MFCC)...
متن کاملText-Dependent Multilingual Speaker Identification using Back Propagation Neural Network and PSO-GA Hybrid Model
In this work a multilingual speaker identification system is proposed. The feature extraction techniques employed in the system extract Mel frequency cepstral coefficient (MFCC), delta mel frequency cepstral coefficient (DMFCC) and format frequency. The feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA). We have used Back Propagation (BP...
متن کاملPerceptual Significance of Cepstral Distortion Measures in Digital Speech Processing
Currently, one of the most widely used distance measures in speech and speaker recognition is the Euclidean distance between mel frequency cepstral coefficients (MFCC). MFCCs are based on filter bank algorithm whose filters are equally spaced on a perceptually motivated mel frequency scale. The value of mel cepstral vector, as well as the properties of the corresponding cepstral distance, are d...
متن کامل